Improving Search Engines via Classification
نویسنده
چکیده
In this dissertation, we study the problem of how search engines can be improved by making use of classification. Given a user query, traditional search engines output a list of results that are ranked according to their relevance to the query. However, the ranking is independent of the topic of the document. So the results of different topics are not grouped together within the result output from a search engine. This can be problematic as the user must scroll though many irrelevant results until his/her desired information need is found. This might arise when the user is a novice or has superficial knowledge about the domain of interest, but more typically it is due to the query being short and ambiguous. One solution is to organise search results via categorization, in particular, the classification. We designed a target testing experiment on a controlled data set, which showed that classification-based search could improve the user’s search experience in terms of the numbers of results the user would have to inspect before satisfying his/her query. In our investigation of classification to organise search results, we not only consider the classification of search results, but also query classification. In particular, we investigate the case where the enrichment of the training and test queries is asymmetrical. We also make use of a large search engine log to provide a comprehensive topic specific analysis of search engine queries. Finally we study the problem of ranking the classes using some new features derived from the class. The contribution of this thesis is the investigation of classification-based search in terms of ranking the search results. This allows us to analyze the
منابع مشابه
Discovering Popular Clicks\' Pattern of Teen Users for Query Recommendation
Search engines are still the most important gates for information search in internet. In this regard, providing the best response in the shortest time possible to the user's request is still desired. Normally, search engines are designed for adults and few policies have been employed considering teen users. Teen users are more biased in clicking the results list than are adult users. This leads...
متن کاملA Technique for Improving Web Mining using Enhanced Genetic Algorithm
World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...
متن کاملThe Core Aspects of Search Engine Optimisation Necessary to Move up the Ranking
Search engine optimization (SEO) is the process of improving the visibility, volume and quality of traffic to website or a web page in search engines via the natural search results. SEO can also target other areas of a search, including image search and local search. SEO is one of many different strategies used for marketing a website but SEO has been proven the most effective. An Internet mark...
متن کاملImproving the Performance and Precision of Bioinformatics Algorithms
Title of dissertation: Improving the Performance and Precision of Bioinformatics Algorithms Xue Wu, Doctor of Philosophy, 2008 Dissertation directed by: Professor Chau-Wen Tseng Department of Computer Science Recent advances in biotechnology have enabled scientists to generate and collect huge amounts of biological experimental data. Software tools for analyzing both genomic (DNA) and proteomic...
متن کاملEvaluation of Web Search Engines with Thai Queries
This paper discusses some challenging issues that are found in the evaluation of web search engines by using Thai queries. The discussions are based on our experience in evaluating and comparing the search performance of 7 search engines on Thai queries. The issues addressed in this paper will help in improving further evaluations of search engines for Thai.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011